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ABSTRACT 

Techniques to improve the data quahty of interferometric radio observations are considered. 
Fundaments of fringe frequencies in the iiv-plane are discussed and filters are used to attenuate 
radio-frequency interference (RFI) and off-axis sources. Several new applications of filters are 
introduced and tested. A low-pass filter in time and frequency direction on single baseline data 
is successfully used to lower the noise in the area of interest and to remove sidelobes coming 
from unmodelled off-axis sources and RFI. Related side effects of data integration, averaging 
and gridding are analysed, and shown to be able to cause ghosts and an increase in noise, 
especially when using long baselines or interferometric elements that have a large field of 
view. A novel projected fringe low-pass filter is shown to be potentially useful for first order 
source separation. Initial tests show that the filters can be several factors faster compared to 
common source separation techniques such as peeling and a variant of peeling that is currently 
being tested on LOFAR observations called "demixed peeling". Further testing is required to 
support the performance of the filters. 

Key words: instrumentation: interferometers - methods: data analysis - techniques: interfer- 
ometric - radio continuum: general. 



1 INTRODUCTION 

For several decades, it has been a challenge to increase the dynamic 
range of images produced by interferometric radio telescopes. The 
raw sensitivity improvements and advanced understanding of cali- 
bration errors have pushed the limits on the dynamic range of mod- 
em telescopes to unprecedented levels (Smirnov 2011). The final 
dynamic range is constrained by the celestial field being observed, 
the efficiency of the telescope's hardware and the time spent ob- 
serving. However, this theoretical dynamic range is limited further 
by imprecise models of instrumental effects and celestial sources 
used in the data reduction process, as well as by the quality of the 
radio environment. 

The noise level in the final result of an observation can be 
set by several phenomena. In the ideal case, the noise level equals 
the thermal sky noise level, and the detection of sources or other 
features is limited by this noise level only. An image can also be 
limited by confusion noise when it does not provide enough res- 
olution to distinguish sources. Sidelobes provide a third type of 
noise, which are generated by the point spread function (PSF) of 
the instrument, that convolve strong sources that are in or outside 
the field of interest. Finally, radio-frequency interference (RFI) can 
add additional noise to the final result of an observation. In this 
paper, we will aim at suppressing noise coming from RFI and side- 



lobe noise coming from off-axis sources, using similar techniques 
based on fringe theory. 

Because we address two problems at once, we will introduce 
both problems individually. In the following subsection, we will 
introduce the problem of RFI and describe current techniques to 
deal with it. Thereafter, we will introduce the concerns of off-axis 
sources and approaches to deal with those as well. 

1.1 Radio-frequency interference 

While technical advances gave rise to better telescopes, different 
technical advances have ironically decreased the quality of the ra- 
dio environment for radio astronomy. A potential problem that lim- 
its the effective dynamic range of modern telescopes such as LO- 
FAR, the WSRT, the GMRT, the ATCA and the VLA, is radio- 
frequency interference (RFI). Fortunately, practically all RFI inter- 
feres within a limited amount of time or frequency channels, and 
can be flagged automatically in post-correlation. In Offringa et aL| 
P010a| ), the SumThreshold algorithm is described and is proven to 
be very accurate for that purpose. Further implementation of the 
method into the LOFAR pipeline has shown excellent results ( |Of-| 
[fringa et al.|20T0 b ). 

Although reasonably strong temporal and spectral RFI can 
successfully be removed by flagging, it is not always a satisfactory 
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solution. Sporadic continuous broad-band RFI for example poses a 
potential problem, since this type of RFI can not be removed by 
flagging. Doing so might affect considerable parts of the obser- 
vation, potentially throwing away too much of the data. | Athreya| 
P009| ) has shown that the GMRT suffers from this type of RFI at 
low frequencies, for example caused by high voltage lines. Athreya 
([2009 ) describes a method to remove this kind of RFI based on 
fringe fitting of RFI. This approa ch has been recently implemented 
in Alpj^(K ogan & Owen|2010] ). This method will be analysed in 
^ Most other telescopes do not report such severe broad-band 
RFI: LOFAR, although build in a populated area, shows very lit- 
tle of this kind of RFI in the currently finished stations (Offringa] 
|& de Bruyn|2011| ) and (E)VLA interference reports also mention 
spectral RFI affecting a few channels, but no broad-band RFI, al- 
though low frequency causes more problems ( Chandler & Perley| 
|2010| §4.6). Nevertheless, when approaching the thermal noise on 
low frequencies, such as LOFAR will do in the future, faint RFI 
might show up. The fringe fitting method is not so well applica- 
ble in these cases, because such RFI will be below the noise. By 
removing a spatial frequency component from (white) noise domi- 
nated data, a component from the noise will be removed instead of 
removing actual RFI. Work has been done to apply post-correlation 
RFI removal techniques for the (E)VLA, by ways of calibrating and 
removing the RFI source ( [Lane et al.|2005| , but this method is te- 
dious and requires the RFI to be reasonably stable. 

Another solution for removing continuous RFI is spatial fil- 
tering by eigenvalue decomposition fLeshem et a l.|2000"l| Smolders] 
|& Hamp son 2002 , |Ellingson & Hampson 2002 ), which disentan- 
gles the contribution of sources from different directions, and sub- 
sequently removes the contributions from the direction of interfer- 
ence. Recently, this was implemented for the Parkes multibeam re- 
ceiver ( |Kocz et al.|2010| ). However, the requirement of specialized 
hardware and/or having to configure the filter before correlation is 
a major disadvantage of spatial filtering techniques, in the context 
of interferometers. The latter requires the configuration to be fixed 
before the observation in most cases. This makes it hard to react 
to unanticipated RFI, and impossible to change the filter after ob- 
serving if the filter has not worked correctly. RFI is often not stable 
enough to be removed during post-correlation processing. 

Another technique for removing sporadic continuous RFI has 
been introduced in |Pen et al.|p009] ), which decomposes the time 
frequency data with a singular value decomposition (SVD). This 
method however was shown in [Offringa et al.| |2010a ) to poten- 
tially alter the astronomical data, making the method less attractive 
to use for data reduction without further research. InlBrig gs et al.| 
( |2000| ), the RFI is subtracted from the data after correlation by the 
use of a reference signal. Unfortunately, such a reference signal is 
not always available or practical to implement. 

1.2 Off- axis sources 

Signals from off-axis sources received in the sidelobes, like RFI, 
decrease the dynamic range of observations, or might even cause 
calibration to fail. New wide-field telescopes such as LOFAR see a 
large area of the full sky, and always have a few strong sources 
in their sidelobes. Examples of such sources are Cassiopeia A, 
Cygnus A and the Sun. These sources are often not of interest, but 
have to be removed accurately. 

^ AiPS is the Astronomical Image Processing System 
( |http : //aips . nrao . edu/| 



A common method to deal with off-axis sources is peel- 
ing (|Noordam| 2004[ |Intema et al.|2009| ). Peeling is iterative, and 
changes the phase centre towards the source, optionally averages 
in time and frequency to suppress other sources, and self-calibrates 
and subtracts the source. This method has shown good results, but 
is very computational intensive — too intensive to use by default 
on high resolution telescopes such as LOFAR. Demixed peeling 
is a variation on normal peeling, that is currently being tested for 
LOFAR observations. However, early results show similar compu- 
tational requirements when the same removal quality is required 
peffs et al.|2006l ). 

Finally, in |Parsons & Backer] ( [20091 ) a delay-delay rate (DDR) 
filter is proposed that disentangles the flux contribution into the dif- 
ferent sky facets they originate from. The DDR-filter was used by 
Parsons & Backer for first order calibration, but the idea of such a 
filter is also attractive for application in a later stage and over longer 
timescales, because the filter can be applied on post-correlated data 
without additional hardware. It is however unclear how accurate the 
filter will be for off-axis source removal. We will propose related 
filters, while trying to increase its application and accuracy. 



1.3 Outline 

In this paper we will describe and analyse new methods for filtering 
both RFI and off-axis sources, with the ultimate goal of reaching 
lower noise levels. We will start by analysing Athreya' s fringe fit- 
ting method in ^and describe why it is insufficient for e.g. LOFAR 
observations. In ^ several new methods will be introduced and 
analysed with the help of simulations. We will test our filtering ap- 
proaches in ^on a WSRT dataset at a frequency of about 140 MHz 
of the field centred on the radio galaxy B 1834-1-62 ( |Schoenmak-| 
|ers et al.||2000| ). At this low frequency, the WSRT is sensitive to 
very bright sources like Cygnus A and Cassiopeia A ( |De Bruyn &| 
Bernardi 2009 ), which despite their large angular distance are not 
sufficiently attenuated by the primary beam. They therefore gener- 
ate intense spurious sidelobes across the target field of view. We 
will discuss the results in ^ where we will also discuss how time 
or frequency averaging and gridding may effect off-axis sources or 
RFI. Finally, we will draw conclusions based on our findings in ^ 



2 ANALYSIS OF THE FRINGE FILTERING METHOD 
2.1 Removing constant RFI 

|Athreya| ( |2009| ) describes how geometrically stationary RFI can be 
removed from an observation by fitting out a sinusoid with a fre- 
quency opposite to the natural fringe rate. A stationary earth-bound 
RFI source receives a fringe rate opposite to the applied fringe stop- 
ping rate. Therefore, one can estimate its contribution. The natural 
fringe rate is given by: 

dw{t) 

= —ujEu{t) cos S, (1) 

with t the sidereal time, uje = I rotation/day, the rotation speed of 
the earth, u(t) the component representing the standard u position 
of the baseline in the i^^;-plane, w{t) the standard i(;-component 
representing the applied phase delay and S the declination of the 
phase centre. When a baseline is orthogonal to the direction of the 
phase centre, up (t) is zero. A stationary source of RFI contributes 
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(a) Fit with constant fringe rate, amplitude found = 12.8 
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(b) Using fringe count, amplitude found =14.7 



Figure 1. Comparison of fitting methods using simulated data: the original amplitude of the source is 16. Only the shown data is used for the fit. Using a 
constant fringe speed (left panel) over this range produces a somewhat less accurate fit compared to using the fringe count for each sample in the fit (right 
panel). The x-axis is in time steps of 15 seconds from the start of the (simulated) observation. At time step 1570, the simulated baseline is orthogonal to the 
direction linking the target source and the phase centre and z/i?=0. Hence, the fringe speed changes significantly over the displayed time range, which can be 
seen by the somewhat elongated fringes near the right. 



to a correlation in the form of the complex function 

RFI(t) =^e"'"^*, (2) 

with A the complex amplitude of the RFI at time t. The 2ti term is 
absorbed in z^f, such that its value is in radians/time unit. This am- 
plitude is initially assumed to be constant over some period [to , t^;] , 
and z/i? is assumed not to change over this time interval. It is then 
possible to estimate A by performing a least square fit between the 
complex function V{t), representing the observed visibilities, and 
the RFI signal by minimizing the error function 




to 

Minimization of e{A) results in 

A^ j V{t)e''^'dt, (4) 
to 

which corresponds to ^ = F{i'f), the frequency component vf 
of the Fourier transform T ofV over the time interval. Therefore, 
removing a Fourier component of a signal can be implemented as a 
standard frequency filter. Equation (|2| corresponds to a single com- 
ponent of the delay-rate (DR) transform, creating a symmetry with 
the DDR filter proposed in Parsons & Backer (2009 ). An exam- 
ple of the application of Equation ^ on simulated data is given in 
Fig.[Ta| The two plots show the result of fitting a sinusoidal function 
to simulated data. We simulated a WSRT interferometer, correlat- 
ing antennae RTO and RT5: a 720m baseline. A single channel is 
simulated with a frequency of 147 MHz. The simulated observa- 
tion has eight sources, seven of which are faint and in the primary 
beam, while the last source simulates an interfering source that is 
four times stronger. This off-axis source generates a visibility am- 
plitude of 16 and is a 40° from the phase centre, hence far from the 
other sources. 

Since changes slowly with time. Equation ([4]) will become 



inaccurate when increasing the time interval. Additionally, it can 
not be calculated near = 0. By observing that the number of 
wavelengths of delay caused by the geometrical delay corresponds 
to the number of rotations applied on the visibilities, we can re- 
place fpt by w{t) — w{to), where w is the applied phase delay in 
radians/time unit as function of time. As w{to) causes a constant 
phase shift, it can be absorbed in A. By substituting i/pt with w{t) 
in Equation ([4]), we get a more accurate solution for A: 



tE 




to 



An example of such a fit is given in Fig. [Tb] As long as the am- 
plitude of the RFI source remains constant, this allows successful 
removal of the source when up ^ 0. As is visualized by Fig. [2] 
it removes the strong source in the example without unwanted side 
effects on the area of interest. 

2.2 Removing variable RFI 

With the algorithm presented by Athreya, the RFI source is not 
only allowed to differ between baselines, but also in time. Since the 
beam rarely follows the RFI source, it is likely that the gain towards 
the RFI source will change. Athreya proposes tiling of the data, 
making separate fits on each tile, where each tile is approximately 
the size of a fringe. However, tiling the data and performing fits on 
each tile causes instability near the borders of the tiles. 

A more accurate way is to perform individual fits for each 
sample, sliding a kernel of weights over the data that are used to 
perform the fit. Two trivial suggestions for a weighting function 
are the rectangular function and the Gaussian function. A rectan- 
gular function would result in a sliding window method, which has 
implementational advantages. However, a rectangular function pro- 
duces a sine response in delay space. Therefore, the fit will be af- 
fected by any other frequency in the data set that corresponds to 
non-zero values in the sine function, which undesirably would re- 
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(a) Original 



(b) Fringe fitting applied 



(c) Difference 



Figure 2. These images show the application of a fringe filter that takes out a hypothetical source with a constant amplitude (Equation j5j). The same 720m 
WSRT baseline and set-up as in Fig. [T] was simulated and imaged without deconvolving. The image in the left panel is the result of imaging without any 
filtering. The middle panel shows the result after application of the filter, while the right image shows the difference. The filter removes the source up to the 
sidelobe confusion noise of the other sources, which is over three orders of magnitude. The residual shows that it does not affect the sources of interest, again 
up to at least three orders of magnitude. This simulated situation is only hypothetical, since it is unlikely that the received power of distant sources remains 
constant. 



move part of the signal of interest. A Gaussian kernel would local- 
ize the frequency response somewhat better. A larger kernel or tile 
size would decrease the frequency response to other frequencies, 
but in order to remove the RFI it would require the received gain of 
the RFI to change less quickly. 

Allowing the amplitude to change in time creates spirals in the 
complex plane. This kind of fitting has recently been implemented 
in the AIPS astronomical package as described by ^Kogan & Owen| 
( [20T0l ). 



2.3 Generalization of the fringe fitting method 

Up to now, the use of the method has been limited to the removal 
of a single (RFI) source that behaves like a point source at the 
celestial pole. It is common practice to peel and/or calibrate for 
sources that are outside the area of interest, because they need to be 
taken out carefully in order to avoid additional sidelobe confusion 
noise. In such a case, the off-axis source is similar to static RFI: the 
source itself is not of interest, but has to be taken out for calibra- 
tion and imaging the field accurately. For this purpose, the fringe 
fitting method can be generalized to remove any point source. This 
requires a small change to Equation |5]), which now becomes: 



A 



tE 



i(w(t)-ws(t)) 



dt. 



(6) 



Here, w{t) is the standard w;-component in the uvw domain as 
before, while ws(t) is the i(;-component for an observation phase 
centred on the source to be removed. While the process is eas- 
ier and faster than normal off-axis source calibration or peeling, in 
practice it will be of little use: it neglects information present in 
polarizations, as defined by the measurement equation (H amaker] 
|et al.|[7996| , and neglects the relations between baselines. Ad- 
vanced calibration algorithms such as the SAGE calibration tech- 
nique ( [Yatawatta et al.|2009{ [Kazemi et al.|201ll ) solve for source 
parameters by combining this information at once, and will in gen- 
eral be more accurate, as long as the source is (coherently) seen in 
multiple polarizations or antennas. 



/ Point source 
in image plane 



Direction of source 
in image plane 




as(t^ 



Figure 3. Cartoon showing how a source in the image plane contributes 
fringes in the uv-p\me. The further the source is from the phase centre (ori- 
gin), the faster the fringe. Function as(t) is the angle between the direction 
of the source and the direction of a specific point in the uv-track as a func- 
tion of time. The smaller q;^, the faster the fringe speed in the track at that 
point. 



3 NOVEL FILTERING TECHNIQUES 

For high dynamic range, the source removal techniques as anal- 
ysed in the previous section might not always suffice: the fringe 
fitting procedure can only remove a single unresolved source at a 
time. Also, since the fitting window has to be reasonably small, 
the fit will be slightly affected by the contribution of other sources. 
Therefore, the source has to be strong to be able to remove it, al- 
though the absolute error made will not depend on the strength of 
the source. 

In the following sections, we will present several filters that 
are aimed to work when the fringe filter does not suffice. The key 
issues that these filter techniques share, are that they do not perform 
fitting on windows, but use the full data at once. They also remove 
high-frequency Fourier components that do not correspond with the 
fringe frequencies of sources of interest. 



3.1 A low-pass filter in time domain 

The visibility of a single point source with strength Iim and coor- 
dinates (/, m) is given by 



(7) 
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Remaining sidelobe 




Figure 4. Applying the low-pass filter on several baselines will filter parts of 
sources that exceed the frequency limit. For a particular source, this corre- 
sponds with multiplying the source with a hourglass shape in the wu-plane 
(left panel). Because of this multiplication, the sidelobes of the source in 
image plane (right panel) will be, relative to the phase centre, filtered in 
tangential direction. Sidelobes in radial direction will remain. 

Define d — (u^v^w) and I — {l,m,n). Since the source Iim is 
real, the phase of V is given by 

0(d) = 27rd • I. (8) 

The property that will be used in the filtering technique, is the im- 
plication of this formula that sources with large \l\, i.e., that are far 
away from the phase centre, have a high fringe speed in the uv- 
plane. 

Without loss of generality, we assume that our interferometer 
has a configuration such that its corresponding i^i;-track is a circle 
that is centred on the uvw-ongin. This only occurs for an East- 
West Interferometer such as the WSRT. However, the technique can 
be straight forwardly extended to other interferometers that create 
possible elliptic tracks that might not be centred on the origin. In 
the assumed case, the uv-plmie position d will be a function of 
time but have a constant size. If a time-sorted sequence of observed 
samples of a single correlation is considered, its fringe frequency 
is given by 

iysit) = ^ = \d\\ls\cosas{t), (9) 

where us (t) is the fringe speed in fringes per second at time t for 
source S, \d\ is the radius of the i^v-track, \ls\ is the distance of 
S to the phase centre and as (t) is the angle between the i^'i;-track 
and the line through S and the phase centre as drawn in Fig. [3] The 
fringe speed will be maximal at points where the corresponding 
i^^;-track is parallel to the direction of the source, and zero when the 
source direction and i^i; -track are orthogonal. The maximal fringe 
speed produced by a source is proportional to the distance between 
the source and the phase centre: i^sit) oc \ls\- 

We will now consider low-pass filtering of the time-sorted vis- 
ibility data with a filter frequency uf, specified in fringes per wave- 
length. Such a filter will have the following two properties: First, 
sources with Vt : iys{t)/\d\ < uf, will never be filtered. In image 
plane, the area corresponding to i/s{t)/\d\ < z/f is a circle that 
is centred on the phase centre. The fringe speed in the i^^;-track is 
translation independent, hence it is not necessary for the track to be 
centred on the origin. In case the i^^;-track is an ellipse, the filter- 
ing area will be an ellipse as well, but we will continue to assume 
circularity. Second, sources outside the circle will be filtered dur- 
ing the periods in which us{t)/\d\ > vf- The differential start and 
end angle, respectively as and a|, at which a source will enter the 



filtered area are given by 

as = arccos ——r , 
\ls\ 

as — — arccos ——. (10) 
\ls\ 

The area filtered is independent of the baseline length because z/f is 
specified in fringes per wavelength. For a single baseline, the filter 
ratio can be calculated with {a% — aj) /tt. Consequently, in an 
array with N baselines with different sizes, the fraction of samples 
in which the source is filtered is given by 

= 1 arccos-—, (II) 

TT \ls\ 

which is therefore the total attenuation of the source by the filter. 

Although we have shown with Equation (iT) that the total at- 
tenuation of a source is known, the shape of the area that is filtered 
is important as well, as that defines the shape of the sidelobes. The 
effect of low-pass filtering is sketched in Fig.|4] the filter removes 
the source fringes at two symmetric radial areas in the i^t'-plane. 
Subsequently, the application of this filter can be seen as an ad- 
ditional multiplication of the source in the uv-plmiQ. Instead of a 
convolution with the nominal point spread function (PSF), sources 
in the image plane are convolved with a partly attenuated PSF. The 
side lobes that the source would normally have are not filtered in 
the direction of the phase centre, and can still increase the noise in 
the area of interest. This effect can be seen in Fig. [5] 

Although this filter does not directly suppress confusion noise, 
it does filter high frequencies that can increase aliasing effects dur- 
ing averaging or gridding ( §5.2| ). A more sophisticated filter will be 
presented in the next section, which utilizes the same theory about 
the fringe speed of sources. 

3.2 A projected fringe low-pass filter in time domain 

As was shown in Section [TT] in order to remove the side lobes 
of an interfering source from the area of interest successfully, the 
interferer has to be filtered over the entire length of the observation. 
We will now introduce a filter with the purpose of filtering out all 
sources in a certain direction beyond a minimum distance from the 
phase centre. 

The first step of the filter is to make the speed of fringes, com- 
ing from any source from a specific direction a d , constant in the 
time direction. This is done by rotating the uv-plmie such that the 
fringes are parallel to the v-axis, and subsequently projecting the 
samples from the track onto the v-axis, thereby stretching the high- 
frequency fringes and pushing together the low-frequency fringes 
from sources from direction an- Fig. [6] visualizes the transforma- 
tion. At each point on the i^i;-track given by an angle a{t), the 
fringe frequency us (t) of a source at time t is multiplied by a factor 
due to the projection, resulting in a new fringe frequency z^projected 
at angle a{t) on the circle, given by 

i^projected — //.\ \- y^-^J 

cos [a[t) — ao) 

By substituting the definition of iys{t) from Equation ^ into 
this equation for a single source in the direction of the filter, i.e., 
as{t) = a{t) — aD, the result IS Z^projected = |d| |Zs|. Hence, the 
fringe speed becomes independent of time. Sources from other di- 
rections, however, will not become constant. 
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Figure 5. Application of a low-pass filter in the time domain ( |3.1) . The source has been attenuated by filtering (left panel), but some of the sidelobes have not 
been removed. This is because the fringe rate of the source does not always exceed the filtering frequency. The middle panel shows what has been removed 
and confirms that the sources of interest have not been attenuated (up to the 100 times lower noise level), the right panel shows with high contrast what has not 
been removed from the source. Note the different intensity scales. 



Direction of source 




(a) Fringe contribution of a source (b) Rotation (c) Projection 



Figure 6. Creating a constant fringe rate towards a single direction. Panel[(a)] A source with a certain direction from the origin in the image plane will cause 
a fringe in the uv-p\me corresponding to that direction. Panel |(b)| Rotating the direction of the source onto the -u-axis will align its fringe with that axis. 
Panel I (c) I Projection of the sample track onto the v-axis will make any source in the direction of rotation have a constant fringe rate. 



12 



10 



-1000 -500 500 1000 
Fourier component index 

Figure 7. Fourier transform of a -track that was rotated and projected, 
such that sources in a certain direction have a constant fringe speed. The 
model of Fig. [5] was used. Most of the contribution of sources near the 
centre collect near Fourier component index zero, while the contribution 
of the off-axis source shows up as a peak at an index away from zero. 



An example of this effect is shown in Fig.|7] which shows the 
Fourier transform of a projected i^i;-track. The model of Fig.[2a|was 



used as input. The projection is towards the direction of the strong 
source in the bottom. This source shows up as an isolated feature 
away from Fourier component index zero, because this source lies 
furthest away from the phase centre. Although the power of this 
source peaks in one component, it is distributed over several Fourier 
components, because the time series is finite. Therefore, the point is 
convolved with the Fourier transform of a windowing function. The 
sources near the phase centre collect at component indices around 
zero. 

By performing a low-pass filter with frequency jyp on the 
projected samples, we will remove fringes from sources at time 
t ^ [to; te] for which 



lis 



COS as{t) 



cos (a(t) 



(13) 



holds. 



Fig. [8] visualizes the application of the filter. Its effect can be 
summarized by these three characteristics: (A) any sources at di- 
rection a D that are further away than the limiting distance corre- 
sponding to Up will completely be removed; (B) sources at direc- 
tion aD within the limiting distance will not be removed at all; 
and (C) any sources from directions other than a d will neither be 
removed completely nor stay untouched completely. The latter is 
because the denominator and the numerator in Equation |T3] ) will 
have zero crossings at different t. Consequently, the left term in 
Equation \13\ will become large when the denominator is near zero. 
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(a) Applied 



(b) Residual 



(c) Difference with model 



Figure 8. Application of the projected fringe low-pass filter ( g3.2} on simulated data. The projected fringe low-pass filter nulls a single direction starting at a 
certain distance, but does not preserve the phase centre well. In this simulation, the off-axis source has been removed completely up to the noise, two orders of 
magnitude lower. In |(a)| the filter is applied and the top source is removed. Panel |(b)| shows what has been removed from the image, while |(c)| shows what has 
been removed from the area of interest. 




Figure 9. Visualization of the first component in a one-dimensional 
CLEAN of the plot in Fig.|7] 

While incomplete filtering of sources in some directions that 
are not of interest is not very problematic, it is impractical that the 
only sources for which absolute preservation can be guaranteed, 
are sources that lie on the line going through the phase centre in 
the direction of the applied rotation. In the next subsection, we will 
present modifications that will solve this issue. 

Despite this complication, this method might still be usable in 
practice. According to Equation |T3] ), the fringes of sources will all 
be filtered around the same angle a(t) in the uv plane. This direc- 
tion is known, and the area in the uv plane that is affected is there- 
fore known. Samples in this area can be removed from the data, 
causing a small loss of data. However, the source will successfully 
be removed without side effects. 

3.3 The iterative projected fringe filter in time domain 

The projected fringe frequency of a on-axis source can exceed the 
filtering frequency when as{t) ^ an, i.e., when the uv-track is 
near parallel to the applied direction of the filter. To create an area 
of unfiltered sources in the image plane, one can leave this range 
out of the filter. This however, would create artefacts similar to the 



Figure 10. In red, showing the sum of the first hundred components re- 
moved by the deconvolution and in green, showing the residuals that contain 
the data for the area of interest. In the Fourier transform similar to Fig. [7] 
^filter part of the data around as{t) ~ was left out to make sure no 
sources in the area of interest map to higher components. 



low-pass filter of §3.1| and would still not improve the dynamic 
range in the area of interest. 

A solution is to perform a Fourier transform only on the part 
of the projected samples at which \as{t) — anl > rjp, for some 
small angle rjF, and use a deconvolution method to extrapolate 
the found frequencies to the area that has been left out. A one- 
dimensional CLEAN on the fringe spectrum can be used to remove 
and extrapolate fringes, taking fringes out one by one. Altogether, 
such a filter removes sources from a single direction a/:) at a dis- 
tance corresponding io and create a rectangular area around 
the phase centre which will be preserved. The width of this area 
is given by 

i^{i^F,r]F) = 1^ |sinryF| • (14) 

Off-axis sources from directions other than an will be partially 
removed and sources of interest will be fully preserved. We will 
discuss the results of practical application of this filter in ^ 



© 2012 RAS, MNRAS 000,[l]{T8] 



8 A.R. Ojfringa et al. 



-30° -20° -10° 0° 

(a) Applied 



-30° -20° -10° 0° 

(b) Residual 



-30° -20° -10° 0° 10° 20° 

(c) Difference with model 



Figure 11. Application of the iterative projected fringe filter ( g3.3) on a single simulated baseline of 720 m as in Fig. [2] The filter was aimed at the source in 
the bottom and iteratively removes fringes with high frequency. A value of 77fiiter = 0.2 was used to preserve all of the centre sources, and 100 one-dimensional 
CLEAN iterations were performed in the projected fringe spectrum domain. Although this has attenuated the source without needing a model of the source, 
the sidelobes in the direction of the phase centre still remain. 



Fig. |9] visualizes the Fourier transform of the first component 
that will be removed by a one-dimensional CLEAN on the plot in 
Fig. [7] In the Fourier transform, ?7fiiter part of the data was left out. 
Because of the finite time domain, the power in a single component 
is convolved with a function formed by the windowing function, 
which also depends on the angle between the source and the filter 
direction. Intuitively, one can think of this as the shape of the PSF 
in the projected fringe spectrum domain of a single baseline. 75 per 
cent of the power in the highest component are selected for subtrac- 
tion in each iteration. Figs. [T0| and [TT] show the resulting projected 
fringe domain and image domain respectively, after applying the 
iterative fringe filter with 100 iterations. 



3.4 Filtering in frequency direction 

The filters that have been presented so far, have been applied in 
the time domain of correlations from a single baseline. If an inter- 
ferometer observes several frequency channels over some limited 
bandwidth, a logical extension is to filter in frequency direction. 
The samples from different frequencies in the same baseline at the 
same time form a straight line in the i^^;-plane. A source S produces 
a fringe speed /xs in frequency direction given by 

/is(t,A) = \d{X)\\ls\smas{t), (15) 

and|d(A)|^f 

A low-pass filter in the frequency direction removes fringes of 
off-axis sources at which /is(t, A) < /x/. In contrast to filtering in 
time, the situation differs on some points: 

• The use of the sin function in Equation |T5] ) implies that 
sources produce a high fringe rate in frequency direction when the 
uv-trsLck is orthogonal to the source direction in the image plane. 
The result is that the source sidelobes in direction of the phase cen- 
tre, which is the area of interest, will be removed. Therefore, a low- 
pass filter in frequency direction would complement a filter in time 
direction, which depends on the cosine of the source angle and the 
uv-track (Equation (|9])). Therefore, the part that is not filtered by the 
latter can be further attenuated with a frequency direction low-pass 
filter. 

• While most radio sources are constant over the observation 
time, they vary over frequency. Low-pass filtering in frequency 



would low-pass filter the sources variation over frequency. Be- 
cause the primary beam is smaller at higher frequencies, an off-axis 
source can have a steep apparent spectral index. 

• In the frequency direction, the number of fringes is limited 
by the observing bandwidth, and the bandwidth might be limited 
such that the fringes of a source rotate too little for filtering. For 
example, if a bandwidth-frequency ratio of 2.5 MHz/ 100 MHz is 
assumed for a 100 m baseline (approximately the shortest WSRT 
baseline observing with a single band), a source needs to be at a 
distance of about 8° from the phase centre to create a single fringe 
within the bandwidth. 

Due to these characteristics, the use of a frequency filter can 
complement a low-pass filter in time, but might be limited to the 
longer baselines or large filter radii. To be effective, sufficient band- 
width is required. The available bandwidth for filtering might be 
further limited if the apparent spectral indices of the off- sources 
are steep. 



4 PRACTICAL APPLICATIONS 

Several filters for off-axis sources were described in the previous 
chapters. Fig. [12] shows an overview of all the filters, applied on 
several classes of simulated off-axis sources. The fringe filter works 
well, as long as an accurate model of the source exists, and the re- 
ceived strength of the source does not change much in time. The 
low-pass filters in time and frequency direction together remove 
the off-axis source quite well. The projected iterative fringe filter 
in time direction can only attenuate the off-axis source moderately, 
even though it requires to know the direction to filter rather ac- 
curate. Application of the method on real data shows comparable 
results. 

4.1 Attenuation efficiency 

To test the level to which sources can be removed, we have sim- 
ulated a single 40 degrees off-axis source in an otherwise empty 
field, i.e., without any on-axis sources, and also without noise. We 
simulated a single 2.5 MHz band at 130 MHz with a standard 
WSRT configuration and compared the level of the sidelobes be- 
fore and after source filtering. The single fringe filter shows 40 dB 
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Figure 12. Simulated test sets with various types of off-axis sources that need to be removed. On its own, the single fringe filter removes the largest part of the 
source and its sidelobes, and only becomes inaccurate when the source changes in time or when the the model is inaccurate. The time and frequency low-pass 
filter complement each other, and together can remove everything outside a certain radius, if bandwidth allows. The projected fringe filter seems not to work 
very well - it removes a part of the source, but leaves artefacts in the image in every test case. 
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Figure 13. Position in the sky of B 1834 relative to other strong sources. 



of sidelobe attenuation on a constant source, but only attenuates up 
to 3 dB of a varying source, which provides a more realistic setting. 

The frequency direction low-pass filter can remove 10 dB of 
a source, which can be varying. Because the low-pass filters are 
less effective near the borders of the band and the start and end 
of the observation, we have tried flagging 5 per cent of the bor- 
der channels in the time frequency plane after filtering. This leads 
to 20 dB of attenuation. The low-pass filter in time direction does 
in theory not remove sidelobe noise in the direction of the source. 
However, in practice, it attenuates the RMS in areas around the 
phase centre by zero to 3 dB. This is because of a property of grid- 
ders: high fringe frequencies are mapped back to the area of in- 
terest, i.e., resampling causes aliasing effects. Therefore, removing 
the high frequencies before imaging lowers the noise as well. The 
RMS decrease in the radial direction due to low-pass filtering in 
time is around 25 dB. The large difference between attenuation of 
the tangential direction of time low-pass filtering versus the radial 
direction of frequency low-pass filtering is due to the limited band- 
width: in time, the observation contains lots of fringes which can 
be accurately filtered, but only a few fringes appear in frequency 
direction. 

In the same test, the projected fringe low-pass filter shows 
25 dB of attenuation around the phase centre. Finally, the projected 
iterative fringe filter attenuates only up to 3 dB. 

Obviously, these results are highly dependent on many param- 
eters, including the distance of the source to the phase centre, the 
amount of available bandwidth and its central frequency, the time 
and frequency resolutions and, for the single fringe filter, the speed 
of change of the source due to instrumental effects and the number 
and size of the interferometers. 



4.2 Low-pass filtering a WSRT observation 

We will now apply the filtering approaches on a WSRT dataset of 
the field centred on the radio galaxy B 1834-1-62. This field was ob- 
served to search for polarized emission in this double double radio 
galaxy ( [Schoenmakers et al.|2000j ) at very low frequencies. The ob- 



servations were done in August 2008 and lasted for 12 h. The back- 
end was configured to observe 8 frequency bands, each 2.5 MHz 
wide and covered in 512 spectral channels, at frequencies ranging 
from 115 to 163 MHz. Here we will only use data from the band 
at 139 MHz. The integration time was 10 s, the spectral resolution, 
after Hann tapering, was 10 kHz. At this time and spectral resolu- 
tion even sources more than 1 radian from the phase tracking centre 
were not significantly smeared. The field was affected by sidelobes 
from Cygnus A, Cassiopeia A and the Sun (for about 8 hours). An 
image of the locations of these sources, in the NCP projection of 
the whole sky suitable for the WSRT — an East- West array — is 
shown in Fig. [13] 

Although each of these three sources is not in the primary 
beam, each of them is strong enough to lower the dynamic range of 
the observation considerably because of their sidelobes in the im- 
age plane. It is hard to remove these sources from the observation, 
because they are in the sidelobes of the beam and, especially in the 
case of the Sun, they are complex and their apparent strength varies 
over time. Because we do not have accurate models of the sources 
in our observation, the low-pass filters are a good choice, and we 
will show that the low-pass filters prove to be quite effective for 
attenuating the three sources. 

Fig, pi] shows a single baseline of the B 1834 observation. The 
baseline used is RTO x RTA, a 1.3 km East- West baseline, and 
only data from a single 2.5 MHz band at 140 MHz was used. The 
displayed images correspond to several tens of degrees of the sky. 
The observation is limited by confusion noise of the Sun (right 
top corner, also aliased to the bottom), Cassiopeia A (left top) and 
Cygnus A (left bottom). The observation takes 12 hours and the (re- 
solved) contribution of the Sun moves through the image and sets 
halfway. Consequently, the Sun and its sidelobes would be very 
hard to remove with traditional methods. The two low-pass filters 
together remove the Sun down to the noise: in the filtered image, 
its peak value is 1 per cent of the original value. It is hard to re- 
move more, i.e., make the filter circle smaller, since only a small 
bandwidth is available. Because of this, the edge of the filter bor- 
der is blurred in the frequency filtering cases. For the same reason, 
Cassiopeia A should have been filtered but is removed only 95 per 
cent, and Cygnus A should not have been filtered, but is attenuated 
25 per cent. These errors occur because these sources are too close 
to the filter border. Other sources within the filter radius have been 
attenuated less than 1 per cent. 

The application of the low-pass filters on this baseline shows 
the practical effectiveness of the filters: filtering in time direction 
removes the tangential components of the sources, while the fre- 
quency direction removes the radial components. The frequency 
filter is not as accurate as the time filter, because of the limited 
2.5 MHz bandwidth available. This causes the circular "filtered" 
area not to have a sharp edge that a perfect sine function would 
produce. Instead, the edge is somewhat blurred. As a consequence, 
a part of Cassiopeia A has been removed, although it did not exceed 
the theoretical cutting frequency. 

In Fig. [15] a shorter baseline was processed with the filtering 
techniques. Baseline RTO x RT2 was used, which is only 288 me- 
ters long. Because of the combination of a short baseline and the 
small available frequency bandwidth, the frequency filter is only 
able to filter out 80 per cent of Cassiopeia A on this baseline. The 
Sun is still successfully attenuated over 99 per cent, up to the noise. 
Cygnus A is 10 per cent attenuated. No other sources in the area 
of interest have been visibly attenuated. Because the off-axis side- 
lobe noise RMS is around 10 per cent of the peak of strong on-axis 
sources in the area of interest, one can conclude from this image 
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Figure 14. Application of the low-pass filters on a single 1.3 km baseline of an actual WSRT observation of the B1834 area, observed partially in daytime. 
Frequency filtering removes the Sun down to the noise, including its sidelobes in the area of interest. The filter is less effective near the circular filter edge. 
The rings are aliasing effects. 
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Figure 15. Application of low-pass filters in both directions as in Fig. [14] but on a shorter baseline of 288 meters. The Sun is successfully attenuated, but the 
filter has been less effective on Cygnus A and Cassiopeia A. 



Table 1. Fringe speed in time and frequency directions as a function of 
scale, looking at zenith with a 1 km baseline. 
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only that the on-axis sources have been preserved for at least 90 per 
cent. 

As discussed, the filter frequency scales linearly with the base- 
line size: on long baselines, the fringe speed of sources is fast in 
both the frequency direction and the time direction. On short base- 
lines, a source might cause only a few fringes or less in the fre- 
quency direction. It is therefore more difficult to filter short base- 
lines, and Fig. [15] visualizes this problem. While the tangential con- 
tribution of Cygnus A has been removed effectively in the figure, 
only a small part of its radial contribution has been removed. The 
filter was able to remove the Sun because it is further away. On 
very short baselines, the real and imaginary components produced 
by a source are almost constant, and applying a low-pass filter in 
frequency direction on such a baseline will perform similar to av- 
eraging the frequency channels together. In such cases, the filter 
will not affect the astronomical data, but only average the noise 
out. If the fringe speed does not exceed the filtering frequency suf- 
ficiently on all baselines, the source will appear in the shorter base- 
lines, hence the large scale structures of the source sidelobes will 
remain. In general, the combination of bandwidth, filter area and 
baseline length define the success of the frequency filter. Table [T] 
shows a few configurations and their corresponding fringe speed 
for a particular baseline size and distance to the phase centre. 

In Fig. [16] all baselines were imaged together. The unfiltered 
Stokes I image is quite severely affected by sidelobes coming from 
off-axis sources. Moreover, because the off-axis sources come in 
through the far side of the primary beam, they appear in the polar- 
ized images as well. After filtering, the confusion noise is reduced 
significantly. Depending on which empty region is selected to cal- 
culate the RMS over, the noise has gone down by a factor of 1.5 
to 2 in Stokes I, while the polarized images show a factor of 2 to 3 



decrease in noise. Because the short baselines could not be filtered 
correctly in the frequency direction due to the limited bandwidth, 
the low-frequency components of the sidelobes remain. With suffi- 
cient bandwidth, such as for LOFAR, the results will be even more 
significant. CLEANing the images of Fig. [T6]removes some of the 
bright sources in the centre, but the strong sources in the sidelobes 
can not be removed by CLEANing. As one can expect, the CLEAN 
algorithm is able to CLEAN deeper and find more sources in the fil- 
tered image. 

Another less obvious effect of the filter is suppression of ghost 
sources that are caused by aliasing of the off-axis sources. When 
looking at Fig. [16] it appears that there is one strong polarized 
source near the centre of the field. However, when performing the 
low-pass filters, the source disappears. The reason for this is that 
the source is not a real source, but a low frequency projection of 
an off-axis source: a ghost. A zoom in on this ghost as in Fig. [Tt] 
shows that the ghost is also present in Stokes I. This ghost is an 
aliasing artefact caused by the gridding in the imager. It appears 
as a normal source and contains regular sidelobes, as can be seen 
in Fig. [16] Low-pass filtering in time and frequency attenuates the 
ghost, as will any other method that attenuates the original off-axis 
source. The aliased ghost is caused by baselines which are gridded 
just below the Nyquist rate of the source. If the source is sampled 
correctly, its ghost will not appear at all. On the other hand, if the 
source is badly undersampled, its contribution will average out. 

4.3 Dealing with flagged samples 

A complicating factor for low-pass filtering the time-frequency do- 
main is the fact that the time-frequency plane contains flagged data 
due to RFI contamination. This has to be taken into account before 
convolving the data with a sine function. To solve the problem, 
we will mimic how flags are handled during other stages of reduc- 
tion. Two techniques for solving flagged samples are commonly 
used. The first is to set flagged samples to zero and account for 
the missing samples when deconvolving. The second is, if the sam- 
ples are flagged before either correlation, further averaging or grid- 
ding, to only average over unflagged samples. The latter is similar 
to linear interpolation of the flagged samples, albeit the uv-position 
should be changed slightly because of the change of the centroid, to 
prevent bandwidth or time smearing. Before correlation or at high 
time and frequency resolutions, the difference between neighbour- 
ing samples is small enough that the error due to linear interpolation 
is small. 



© 2012 RAS, MNRAS 000,[l]{T8] 



Post-correlation filtering techniques 13 

"n +70°p — \ — ^ — \ — \ — ^ — \ — \ — ^ — \ — \ — \ — ^ — \ — \ — r \ \ \ — ^ I 




0.0020 



19^^00™ 



18^30™ 



Figure 16. A WSRT observation of field B1834 at 140 MHz containing three strong off-axis sources (see Fig. [13). WSRT can observe eight bands with 
2.5 MHz bandwidth at this frequency, however, for this image, only one of the eight bands is used. The top and bottom figures show Stokes I and Stokes Q 
respectively. The left images are from the raw data, the right images shows the same data after low-pass filtering the set in both time and frequency directions. 
Even though the filter is limited by the small bandwidth, the suppression of the confusion noise of off-axis is significant. The effect is more detectable in 
the polarized images. Depending on which area is used for RMS calculation, the Stokes I and Q images show a noise reduction by a factor of 1.5-2 and 2-3 
respectively. Moreover, a ghost of one of the off-axis sources (Cyg A) is strongly attenuated (see Fig.[T7). 
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Figure 17. Enlargement of the central area of Fig. [16] Aliasing of off-axis 
source causes a ghost in the primary field, which is attenuated by the low- 
pass filter. 



Since these methods have shown sufficient accuracy in prac- 
tice, we have used a similar linear interpolation scheme: the data 
is interpolated by performing a Gaussian convolution on the un- 
flagged data. The flagged samples in the original image are subse- 
quently replaced with values from the convolved image. The result 
of this procedure on one of the WSRT B8134 set is given in Fig. [18] 
Normally, only data that are not flagged are used for imaging. For 
Fig. [18] this is the data from panel |(b)| To be able to filter the set, 
the flagged samples are interpolated as in panel [(c)] Tests using 
all baselines of the WSRT B1834 set show that the difference be- 
tween imaging of the flagged set and the interpolated set in which 
all samples are used are small, as sources in the area of interest are 
changed less than 1 per cent. After low-pass filtering, we reapply 
the old flags. The rationale for this is to make sure that ranges that 
contain RFI are not used during further reduction, and the interpo- 
lated data is only used for filtering. 



4.4 Computational requirements 

For filtering the observation of B 1 834, we have used a regular desk- 
top with a dual core Intel Core2 CPU running at 2.13 GHz and 
2 GB of memory. Filtering the measurement set to create Fig.[T6]in 
time and frequency direction, including interpolating the RFI sam- 
ples, takes on the order of an hour on this machine, while we have 
been performing the filtering step with a non-optimized proof-of- 
concept script. This time is comparable with the time it takes to 
image the data set with the Iwimageij^that was used to create the 



^ The Iwimager or Light Weight Imager is part of the casarest program, 
a subpackage of the Common Astronomy Software Applications packai 
( |http : //casa . nrao . edu7 > 



Lge 



images. The measurement set contains 91 baselines with 4 polar- 
izations, 4300 time steps and 512 channels, and is 8 gigabytes in 
size. The 10 takes about 15 per cent of the time. Hence, the com- 
putational requirements for filtering are not excessive. The method 
performs around an order of magnitude faster than demixed peeling 
as implemented in the LOFAR pipeline. 

One complicating factor is that observations with a large num- 
ber of frequency channels are often split up in many (sub-)bands. 
This is for example the case for LOFAR observations. Since the 
total data can become large, the sub-sequences are divided over 
several nodes on a cluster. Efficient synchronisation of the data be- 
tween the nodes is not trivial, but by using a few nodes concur- 
rently, we have been able to successfully filter a high resolution 
LOFAR observation within a few hours. 



5 DISCUSSION 

5.1 Comparison of filter methods 

The filters discussed were the single fringe filter (^, the low-pass 
filter (^3l]and ^34]) and the projected fringe filters (5T2]and p3] l. 

The single fringe filter as proposed by Athreya and the intro- 
duced projected fringe filter can be applied before ionospheric cal- 
ibration. We have shown that the single fringe filter is acceptable 
accurate for removing stable RFI sources, as long as the source to 
be removed is strong and reasonably constant. The filter should in- 
clude the change in fringe frequency within the window as in Eq ([5]) 
for maximum accuracy. We do not observe stable, broadband RFI 
in LOFAR or WSRT that can be dealt with this method. To remove 
off-axis sources with the single fringe filter, an accurate model 
of the source is needed. In practical situations with non-constant 
sources, the fitting error exceeds 10 per cent and is therefore highly 
inaccurate in comparison to common ways to remove sources. It is 
therefore too inaccurate to be useful for off-axis source fitting. 

One of the reasons for a projected fringe filter to be useful 
is that it requires no model, except for a direction to filter to- 
wards. However, the iterative projected fringe filter was shown not 
to be accurate enough and will in general remove little more than 
50 per cent of the source's power. Hence, the iterative projected 
fringe filter provides little benefit when removing (celestial) off- 
axis sources. The projected fringe low-pass filter can remove a 
source completely, but has the unwanted effect of filtering part of 
the area of interest. However, this unwanted effect only occurs on a 
small part of the data; the further the source that is to be removed is 
from the area of interest, the smaller the area. A possible approach 
might therefore be to exclude the part of the data on which the 
fringe speed of the area of interest exceeds the filter speed. Subse- 
quently, the data can be calibrated on first order, and the calibration 
solutions can be extrapolated to the excluded data. The method is 
about an order of magnitude faster than peeling and demixed peel- 
ing. This approach needs further research. 

In contrast to the single and projected fringe filters, the use 
of the introduced low-pass filter lies mainly in removing off-axis 
sources. The low-pass filter in frequency will low-pass filter any 
structure in frequency direction, thus is probably only useful for 
multi-frequency synthesized imaging. In this situation, the fre- 
quency low-pass filter is an ideal tool to improve the signal to 
noise ratio of the area of interest after all calibration and subtrac- 
tion of modelled sources has taken place, because it attenuates ra- 
dial sidelobes. When structure in frequency direction is important, 
e.g., when performing spectrography, the method can not be ap- 
plied. The frequency low-pass filter is not necessarily limited to 
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Figure 18. A baseline for which the flags have been interpolated and filtered. Note that in panel |(c)| the RFI is still visible in the interpolated time frequency 
plot by eye, because those areas have a lower variance compared to the original. 



application after calibration. Because the phases and amplitudes are 
reasonably stable in frequency direction, it can be assumed that fil- 
tering in frequency direction will not remove information essential 
for calibration - as long as all modelled sources are within the un- 
filtered area in image plane. 

The low-pass filter in time might be less applicable for uncali- 
brated data, because it removes the high-frequency components in- 
troduced by quick phase or amplitude changes such as ionospheric 
changes. This problem is less relevant on longer baselines, because 
of the faster fringe speed: at \=2\ cm, a single degree off-axis 
source has a fringe duration of 17 min on a one kilometre baseline. 
The low-pass filter in time removes tangential sidelobes of off-axis 
sources, which implies that the sidelobe confusion noise in the area 
of interest is not directly attenuated. Nevertheless, this filter can be 
useful to reduce aliasing effects, such as removing an aliased ghost, 
where it is complementary to the frequency low-pass filter. 

In case the low-pass filter in the time or frequency direction 
is applied before calibration, one should make sure that the filter 
does not introduce baseline- specific errors (closure errors), because 
these might cause self-calibration to fail. Since all presented fil- 
ters are applied on individual baselines, this holds for all the filters. 
Although Athreya ("2009 ) argues that fringe fitting does not intro- 
duce closure errors, that only holds if the fit is perfectly accurate. 
It is unclear if this is generally true, because the accuracy of the 



fit is dependent on the fringe rate, and therefore baseline depen- 
dent. However, as long as the baseline-dependent error is small, 
self-calibration will benefit from the removal of the RFI source. 
We have not yet looked at calibrating filtered data, and this requires 
further research. 

For low-pass filtering we have only looked at applying a rect- 
angular windowed sine convolution (truncated sine), naturally im- 
posed due to the finite time/frequency range. Especially when the 
window is small in comparison to the size of a fringe rotation, non- 
rectangular windows might improve efficiency. Different window 
functions can provide different trade-offs between the sidelobes 
and the steepness of the filter edge in the image plane: functions 
with a small resolution bandwidth, such as the rectangular func- 
tion, will create a sharp edge that has ripples. On the other hand, 
functions with high sidelobe fall-off will create a smoother edge 
and will suppress the ripples better. An example of such a function 
is the Hann function ( Harris] 1978| ). 

It is harder to distinguish off-axis sources from on-axis 
sources in data that corresponds to specific areas in the i^^;-plane. 
The i^^;-areas for which this is the case, are areas at which the rota- 
tion angle of the i^^;-track is near the rotation angle of the off-axis 
source in the image plane. The reason for this is that the fringes 
of off-axis sources are slow in time direction in these i^v-areas, 
and cannot be distinguished from the slow fringes of sources near 
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Figure 19. The effect of over-averaging an observation in time direction, 
causing off-axis sources to be partly filtered in the long baselines. The left 
and right panel show respectively the uv-plmQ and the image domain. 



the phase centre. Any method that tries to separate off-axis sources 
from on-axis sources, will consequently be less accurate in these 
areas. Unfortunately, off-axis sources cause sidelobes that interfere 
with the phase centre in these same areas, hence it is important to 
accurately remove the off-axis sources from these areas in order to 
achieve high dynamic ranges. Using frequency bandwidth to distin- 
guish sources is necessary in these ranges. Many algorithms look at 
small bandwidths at a time. For example, most algorithms currently 
applied for LOFAR, such as demixed peeling or self-calibration, 
currently only use information from one or a few subbands at a 
time, while a LOFAR subband is only 200 KHz. To accurately sepa- 
rate off-axis sources with these algorithms, multiple subbands have 
to be combined together. 

Low-pass filtering is an implicit effect of integrating and av- 
eraging that occurs in the standard pipeline of interferometers. The 
implications of that will be discussed in the next section. 



5.2 Adverse effects of time and frequency averaging 

To reduce the data volume, the correlation coefficients are inte- 
grated over time directly after correlating, and are sometimes fur- 
ther time averaged, for example after a RFI flagging procedure has 
detected corrupted samples, as is the default for LOFAR. When 
imaging, the visibilities are once more averaged for gridding, to be 
able to apply a fast Fourier transform (FFT). Nyquist's theory states 
that the original signal can be reconstructed as long as the sampling 
frequency is at least two times the highest frequency. Hence, in or- 
der not to lose information, the sampling frequency in time and fre- 
quency should be twice the fringe frequency of the source given by 
respectively Equation ^ and ( p3] l. In this section we will discuss 
two side effects of averaging: (1) the effect of low-pass filtering and 
(2) the effect due to aliasing. 

When data is averaged, the highest frequency components can 
no longer be presented, and the data is therefore filtered of high 
frequencies. The corresponding side effects of time and frequency 
averaging can be deducted from the low-pass filtering results. Since 
the amount of averaging is normally independent of the baseline 
size, i.e., all baselines will be averaged equally, an off-axis source 
will only be filtered in long baselines. This has been sketched in 
Fig.[T9]for over-averaging in time direction. Over- averaging in fre- 
quency is similar, but in radial direction. For these reasons, the ef- 
fect of time and frequency averaging is baseline dependent and will 
contribute to closure errors. It is also a direction-dependent effect 
(DDE), since the distance of the source to the phase centre defines 
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Figure 20. Simulated effect of decreasing the time resolution with a factor 
of 3x and 25 x, on one single baseline with a single source, using two 
different methods: (A) averaging the data; and (B) low-pass filtering the 
data followed by nearest neighbour interpolation. 



its fringe speed, and therefore the amount of attenuation. Therefore, 
different positions on the sky will be differently attenuated. Finally, 
averaging in time and frequency directions only complement each 
other partly: even by over- averaging the time and frequency direc- 
tions significantly, the shorter baselines will still contain the source. 

In an over-averaged set, a source will appear at its original 
location, but the source is fully present only in a subset of the base- 
lines, which will cause it to have irregular sidelobes. Therefore, the 
source can not perfectly be removed with CLEAN, unless CLEAN 
is performed baseline by baseline or on smaller ranges of base- 
lines, which is harder due to the low signal-to-noise ratio and dirtier 
point spread function of fewer baselines. Direction-dependent cal- 
ibration might help, but directions that have been attenuated might 
still cause problems, e.g., in some antennas they will generate high 
gain solutions and therefore introduce noise. For these reasons, it 
is important to remove strong sources with fast fringe rates before 
time or frequency averaging in order to avoid their side lobes or 
added noise in the area of interest. This effect is most prominent 
in interferometric elements with a large field of view — a small 
element beam will naturally attenuate off-axis sources. 

A second side effect of averaging comes from the fact that 
averaging is not a perfect low-pass filter, and will cause aliasing 
effects of high frequencies in the lower fringe frequencies. This 
will increase the noise generated by off-axis sources because they 
will not be filtered as much as possible. Time averaging can also 
distort sources of interest and can even generate ghost sources if 
off-axis sources have not been removed beforehand, as was seen 
in Fig. [it] To remove these effects, a low-pass filter can be used 
before down sampling the visibilities. 

Fig. [20| shows the difference on a simulated observation be- 
tween these two methods of changing the time resolution: (A) av- 
eraging the data; and (B) low-pass filtering the data followed by 
nearest neighbour interpolation. The down sampling factor was 3 
and 25 for respectively the top and the bottom lines. The source 
is 30° from the phase centre and the simulated WSRT baseline is 
720 m, observing at 140 MHz and 62° declination. The maximum 
fringe speed is 30 Hz and the correlator integration time was 5 sec- 
onds. The figure demonstrates the non-ideal effect of averaging: 
sources which fringes beat with half the (new) Nyquist speed are 
attenuated up to 25 per cent, which does not occur in the filtered 
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case. Moreover, a source that beats faster than the Nyquist speed 
(bottom Hnes) is better attenuated with less ahased sidelobes by the 
filtering compared to averaging. The attenuation effect of averaging 
quickly decreases when the source is closer to the phase centre, but 
is still on the order of one percent at one degree when three times 
averaged. 

Time averaging has been used to average out RFI or other 
sources that have a high fringe rate. | Athreya| ( [2009] ) describes that 
RFI can be attenuated because of fringe stopping, although it is 
said that this is less effective at low frequencies. In Kogan & Owen| 
P010| ), the authors also describe averaging out RFI. As this article 
has shown, although the source itself is attenuated by averaging, 
and therefore helps calibration, we have shown it is better to per- 
form an explicit low-pass filter before downsampling. The fringe 
frequency expressed in fringes/sample is almost always higher in 
time direction compared to frequency direction. Hence, if one re- 
lies on fringe stopping and correlator averaging to suppress RFI or 
off-axis sources, the noise in the area of interest is still affected by 
the source, since time averaging does not remove sidelobes in the 
direction of the phase centre (Fig. [5]). 

Time and frequency averaging are also part of the peeling al- 
gorithm, where it is used to filter off-axis sources. From the per- 
spective of maximum attenuation, the baselines should be filtered 
with a filter size relative to the baseline length, instead of the de 
facto method of uniform averaging. This would suppress off-axis 
sources as much as possible, and equal in all baselines. However, 
care should be taken not to remove small temporal changes due 
to the ionosphere, that are needed for calibration. Fortunately, the 
ionosphere is typically stable in timescales of several minutes. 

It is well known that data averaging can cause tangential 
and radial smearing when averaging respectively the time and fre- 
quency dimension ( [Bridle & Schwab] |1999| ». The symptoms of 
bandwidth and time smearing can be intuitively explained with the 
results of this paper. As we have seen, the tangential and radial 
smearing happens because the longer baselines attenuate the source 
in a particular area of the i^^; -plane. 

By using appropriate resampling techniques such as described 
in the paper, instead of time or frequency averaging which is used 
de facto, it is possible to reduce a data set to a smaller size with 
fewer artefacts. This might especially become important for arrays 
with a large field of view, long baselines and high data rates, such 
as LOFAR and the SKA, or high frequency interferometers such as 
ALMA. In the future, it might be interesting to resample short base- 
lines to lower resolutions, as these baselines contain the slowest 
fringe rates. This could further reduce the size of a measurement. 
However, operations such as calibration currently can not handle 
irregularly sampled data. 

5.3 Relation to gridding 

To perform the two-dimensional FFT transform used for imaging 
the data, the uv-tracks are normally gridded onto a uniform grid. 
Like averaging, this has the side effect of low-pass filtering the data: 
the maximal fringe speed in any direction is defined by the grid 
resolution. In contrast to time or frequency averaging, the filter size 
is relative to the length of the baseline: long baselines are gridded 
with a finer resolution compared to short baselines. The filtering ef- 
fect of gridding is therefore equal to low-pass filtering in time and 
frequency: off-axis sources will be attenuated equally in all base- 
lines. The somewhat counter-intuitive fact is that coarsely gridding 
the uv-plane will suppress sidelobes of off-axis (RFI) sources in 
the image plane, and might increase the signal-to-noise in the area 



of interest. Furthermore, frequencies that can not be represented in 
the UV-plane, correspond with sources that fall outside the image 
plane. Therefore, imaging only the area of interest is an efficient 
way of filtering off-axis sources not of interest. 

Analogues to time and frequency averaging, the down- 
sampling before gridding is performed in a non-ideal way, for ex- 
ample by averaging From the conclusions in this work, we think 
aliasing effects are the reason why off-axis source that are not vis- 
ible in the image plane, still produce sidelobes when performing 
regular gridding. The side effects are similar to the effects presented 
in Fig.|20] which shows that sources both faster and slower than the 
Nyquist frequency are inefficiently attenuated. To solve this, the 
high fringe frequencies should be removed before gridding the data 
on the uv-plane. Again, the best way to do this is to low-pass filter 
the time and frequency directions before gridding. 

5.4 Relation to other techniques 

Although we have not tried combining this method with techniques 
such as (demixed) peeling, it is likely that the presented low-pass 
filters can complement these. There are two reasons for this: 

• During calibration, the solutions are constrained by solving 
for antenna gains and by using the measurement equation. Cali- 
bration normally assumes solution constantness over short time in- 
tervals and small bandwidths, and does not assume relations over 
the full time or frequency range. The low-pass filter uses the full 
time-frequency domain of a single baseline to disentangle sources. 
Therefore, it uses information that is complementary to the infor- 
mation used in standard removal techniques. 

• The low-pass filtering techniques are not model-based. On one 
hand, this allows direct and unbiased removal with less chance 
of inadvertently biasing towards an incorrect model, but on the 
other hand implies that there might not be enough data to sepa- 
rate sources in certain cases. Another difference with model based 
fitting, is that model based fitting can fail to converge due to an 
insufficient signal to noise level. Low-pass filtering is not limited 
by the signal to noise: due to the linearity of the Fourier transform, 
the result of low-pass filtering two time or frequency streams sepa- 
rately followed by averaging is equal to filtering the average of the 
two streams. 

Because the low-pass filtering techniques do not involve non- 
linear fitting, they are much faster. If the filter techniques can be 
used for first order removal of off-axis sources, they might save a 
considerable amount of processing time. Investigation of the rela- 
tion between the filter methods and other techniques will be the 
focus of further research. The LOFAR telescope provides a good 
test case for further research. Because of its large data volumes, its 
processing power is a considerable limitation, and it could poten- 
tially benefit a lot from faster source subtraction algorithms. 



6 CONCLUSIONS & OUTLOOK 

We have shown that several filters can be used on individual base- 
line correlations to attenuate both off-axis sources and RFI sources 
in radio observations, thereby increasing the dynamic range of the 
observation. Because of the high performance of the filters, they 

^ Most software packages do use more elaborate ways of sampling the data 
on the grid, for example by using prolate spheroidals. 
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are suitable for modern high-resolution observatories and can offer 
a complementary or alternative way to remove the sources. Espe- 
cially the low-pass filter in the time and frequency directions are 
attractive, as they effectively attenuate all sources and their side- 
lobes outside a certain radius from the phase centre. However, they 
work less well on shorter baselines, and need a considerable band- 
width to remove sources effectively. 

The next step is to further test the methods on other data, 
preferably with larger bandwidths, to see if the methods work in 
practice as well as in theory in other cases as well. Applying the 
filter on LOFAR data is attractive, because the off-axis source re- 
moval methods currently used are computationally intensive. With 
the large bandwidth of LOFAR, it would in theory be possible to, 
e.g., filter all sources outside 10 degrees even on baselines as short 
as 100 meters. 
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